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Coding of stereo signals 
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This invention relates to the coding of multichannel signals including at least a 
first and a second signal component. More particularly, the invention relates to the coding of 
multiphonic audio signals, such as stereophonic signals. 



5 

Stereophonic audio signals comprise a left (L) and a right (R) signal 
component which may originate from a stereo signal source, for example from separated 
microphones. The coding of audio signals aims at reducing ttie bit rate of a stereophonic 
signal, e.g. in order to allow an eflScient transmission of sound signals via a communications 

10 network, such as the Internet, via a modem and analogue telephone Unes, mobile 

commimication channels or other a wireless networks, etc., and to store a stereophonic sound 
signal on a chip card or another storage medium with limited storage capacity. 

US patent no. 6,121,904 discloses a compressor for compressing digital audio 
signals comprising corresponding predictors for the left and right stereo channels. The 

1 5 predictor for the left chaimel receives a current sample and previous samples of the left audio 
signal as well as the current and previous samples of the right audio signal and produces a 
predicted next sample of the left signal. Similarly, the predictor for the right channel receives 
a current sample and previous samples of the right audio signal as well as the current and 
previous samples of the left audio signal and produces a predicted next sample of the right 

20 signal. 

It is an object of the present invention to provide a method of and an 
arrangement for coding multichannel signals with a low bit rate. 



25 



The above and other objects are achieved by a method of encoding a 
multichaimel signal including at least a first signal component and a second signal 
component, the method comprising the steps of 
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- detennining a set of filter parameters of a prediction filter such that the prediction filter 
provides an estimate of the second sig^al component when receiving the first signal 
componmt as an input; and 

- representing the multichannel signal as the first signal component and the set of filter 
5 parameters. 

Consequently, by encoding the multichannel signal as a first signal component 
and a set of fiUer parameters, the multichaimel signal is encoded with a bit rate which is only 
sUghtly higher than that of a single channel, e.g. a mono channel. The resulting encoded 
signal may be stored and/or commimicated to a receiver. The invention is based on the 

1 0 recognition that for many multichamiel signals one signal component may be predicted fix>m 
at least one other chaimel of the mxiltichannel signal by an adaptive filter process. 
Consequently, when the determined filter parameters are commimicated to a decoder, the 
multichannel signal may be retrieved on the basis of the first signal component and the filter 
parameters, allowing the decoder to model the second signal component. 

1 5 The term multichannel signal comprises any signal including two or more 

interrelated signal components. Examples of such signals include multiphonic audio signals, 
such as stereophonic signals, or the like, comprising synchronised recordings of the same 
audio presentation. According to some embodiments of the invention the multichannel signal 
comprises transformed signal components of a multichannel soxirce signal, e.g. transfomied 

20 stereophonic signal components generated by transforming the L and R stereo signals into a 
transformed set of signals which may be better suited for the modelling of one signal 
component by another according to tiie invention. Further examples of multi-channel signals 
include signals received from a Digital Versatile Disc (DVD) or a Super Audio Compact 
Disc, etc. 

25 In a preferred embodiment of the invention, the step of detOTnining the set of 

filter parameters comprises the step of determining the filter parameters such that a difference 
of the second signal component and the estimated signal componrat is smaller than a 
predetermined value. When the difference between ttie modelled signal and the second signal 
component is small, the modelled signal provides a good estimate of the second signal 

30 component. Hence, a measure of quality is provided for the modelling of the second signal 
component, thereby ensuring that the coding process according to the invention provides a 
minimuTn reduction in quality, e.g. in the example of stereo audio signals minimmn audible 
distortions of the signal. 
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According to a further preferred embodiment of the invention, the step of 
representing the multichamiel signal as the first signal component and the set of filter 
parameters further comprises the step of representing the multichannel signal as the first 
signal component, the set of filter parameters, and an error signal indicative of the difference 
5 of the second signal component and the estimated signal component, if said difference is not 
smaller than said predetermined value. 

Hence, if the estimated signal provided by the step of filtering does not model 
the second signal component sufficiently well, the error signal is included in the encoded 
signal, thereby providing the decoder with additional information. The decoder may combine 
10 the predicted signal with the received error signal, thereby achieving a good approximation of 
the second signal component. The bit rate used for communicating the error signal maybe 
varied, e.g. according to the bandwidth available for a coixmiunication link at a given time. 
Hence, it is an advantage of the invention that it provides the possibiUty for a trade-ofif 
between the bit rate used for communicating the signal and the signal quality at the receiver. 
15 Therefore, a mechanism for graceful degradation is provided, e.g. by adaptively increasing or 
decreasing the bit rate allowed for the error signal. 

In another preferred embodiment of the invention, the method further 
comprises the step of transforming at least a first soxirce signal component and a second 
source signal component of a multichannel soxurce signal into the first and second signal 
20 components. Consequently the first and second signal components are respective 

combinations of the first and second source signal components, thereby providing an input 
signal to the prediction filter which may be better suited for predicting the second signal 
component as the corresponding source signals. Examples of transformations include linear 
combinations of the first and second source signals, for example, in the case of stereophonic 
25 audio signals the combinations L+R and L-R. Further examples include rotations in sigoal 
space and other transformations. The transformation may be parameterised by transformation 
parameters which may be fixed or adaptive, i.e. fliey may be adapted according to properties 
of the source signal. 

In a further preferred embodiment of the invention, 
30 - said first signal component is a principal component signal of a source multichannel 

signal including a nmnber of source signal compon^ts and the second signal component 
is a corresponding residual signal; 
- the metiiod furtho: comprises the step of transfoiming at least the first and second source 
signal components by a predetermined transformation into the principal component signal 
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including most of the signal enCTgy and at least the residual sigaal including less energy 
than the principal component signal, the predetermined transformation being 
parameterised by at least one transformation parameter; and 
- the step of representing the multichannel signal as the first signal component and the set 
5 of filter parameters further comprises the step of representing the multichannel signal as 

the principal component signal, the set of filter parameters, and the transformation 
parameter. 

Hence, according to this embodiment, the multichannel signal is represented 
by the principal signal, the transformation parameter, and flie set of filter parameters allowing 

10 the receiver to model the small residual signal, thereby improving the coding efficiency for 
the multichannel signal. This embodiment is based on the recognition that for many 
mxiltichannel signals, e.g. in the case of audio signals for music and speech signals, the 
residual signal may accurately be estimated as a filtered version of the principal signal. It is 
therefore an advantage of this embodiment that it provides a particularly efficient method of 

1 5 encoding which preserves a high level of quality. 

Preferably, the optimal transformation parameter may continuously be tracked, 
thereby ensuring the transformation remains optimal even if the characteristics of the input 
signal changes, e.g. in the example of an audio signal due to a moving sound source or 
changes in acoustic properties of the environment. 

20 When the predetermined transformation is a rotation and the transformation 

parameter corresponds to an angle of rotation, a simple transformation is provided based only 
on a single parameter, the angle of rotation. By adapting the angle such that the signal 
components, e.g. the L and R signal components of a stereo signal, are rotated into a principal 
component signal and a residual signal, an efficient coding is provided while maintaining a 

25 higih quality signal. 

It is an advantage of the invention that it provides an efficient bit-rate 
utilisation, i.e. a coding scheme which uses a low bit rate for a given sound quality. The 
coding scheme according to the invention may be used to reduce the bit rate without 
significantly reducing the sound quaUty, to maintain the bit rate while improving the sound 

30 quality, or a combination of the above. 

In a preferred embodiment of the invention, the step of deterniiiiing a set of 
filter parameters fiirther comprises the step of determining at least one scaling parameter 
(PuPi) for scaling the estimate of the second signal component such that a measure of 
correlation between the second signal component and the estimate of the second signal 
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component is increased. Consequently, a measure of similarity between the estimated and the 
actual signal is optimised, thereby further improving the quality of the coded signal. 

The invention further relates to a method of decoding multichannel signal 
infonnation, the method comprising the steps of 
5 - receiviag a first signal component and a set of filter parameters; 

estimating a second signal component using a prediction filter corresponding to the 
received set of filter parameters, the prediction filter receiving the received first signal 
component as an input. 

The present invention can be implemented in different ways including the 
10 metiiods described above and in the following, arrangements for encoding and decoding 
multichannel signals, respectively, a data signal, and further product means, each yielding 
one or more of the benefits and advantages described in connection with the first-mentioned 
method, and each having one or more preferred embodiments corresponding to the preferred 
embodiments described in connection with the first-mentioned method and disclosed in the 
1 5 dependant claims. 

It is noted that the features of the methods described above and in the 
following may be implemented in software and carried out in a data processing system or 
other processing means caused by the execution of computer-executable instructions. The 
instructions may be program code means loaded in a memory, such as a RAM, firom a storage 
20 medium or firom another computer via a computer network. Alternatively the described 
features may be implemented by hardwired circuitry instead of software or in combination 
with software. 

The invention further relates to an arrangement for encoding a multichannel 
signal including at least a first signal component and a second signal component the 
25 arrangement comprising 

- a prediction filter for estimating the second signal component, the prediction filter 
corresponding to a set of filter parameters and receiving the first signal component as an 
input; and 

- processing means for representing the multichannel signal as the first signal component 
30 and the set of filter parameters. 

The invention fiirther relates to an arrangement for decoding a multichannel 
signal corresponding to at least two signal components, the arrangement comprising 

- receiving means for receiving a fiarst signal component of the multichannel signal and a 
set of filter parameters; 
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- a prediction filter for estimating a second signal conoiponent of the multichannel signal, 
the prediction filter receiving the received set of filter parameters and the received first 
signal component as an input. 

The above arrangements may be part of any electronic equipment including 
computers, such as stationary and portable PCs, stationary and portable radio 
communications equipment and other handheld or portable devices, such as mobile 
telephones, pagers, audio players, multimedia players, communicators, i.e. electronic 
organisers, smart phones, personal digital assistants (PDAs), handheld computers, or the like. 

The term processing means comprises general- or special-purpose 
programmable microprocessors. Digital Signal Processors (DSP), AppUcation Specific 
Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field Programmable Gate 
Arrays (FPGA), special purpose electronic circuits, etc., or a combination thereof. The above 
first and second processing means may be separate processing means or they may be 
comprised in one processing means. 

The term receiving means includes circuitry and/or devices suitable for 
enabUng the communication of data, e.g. via a wired or a wireless data link. Examples of 
such receiving means include a network interface, a network card, a radio receiver, a receiver 
for other suitable electromagnetic signals, such as infrared light, e.g. via an IrDa port, radio- 
based communications, e.g. via Bluetooth transceivers, or the like. Further examples of such 
receiving means include a cable modem, a telephone modem, an Integrated Services Digital 
Network (ISDN) adapter, a Digital Subscriber Line (DSL) adapter, a sateUite transceiver, an 
Ethernet adapter, or the like. 

The term receiving means fiirther comprises other input circuits/devices for 
receiving data signals, e.g. data signals stored on a computer-readable medium. Examples of 
such receivmg means include a floppy-disk drive, a CD-Rom drive, a DVD drive, or any 
other suitable disc drive, a memory card adapter, a smart card adapter, etc. 

The invention finther relates to a data signal including multichaimel signal 
information, the data signal being generated by a method described above and in the 
following. The signal may be embodied as a data signal on a carrier wave, e.g. as a data 
signal transmitted by communications means as described above and in the following. 

The invention further relates to a computer-readable medium comprising a 
data record indicative of multichaimel signal information generated by a method described 
above and in the following. The term computer-readable medium comprises magnetic tape, 
optical disc, digital video disk (DVD), compact disc (CD or CD-ROM), mini-disc, hard disk. 
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floppy disk, ferro-electric memory, electrically erasable programmable read only memory 
(EEPROM), flash mraiory, EPROM, read only memory (ROM), static random access 
memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random 
access memory (SDRAM), ferromagnetic memory, optical storage, charge coupled devices, 
5 smart cards, PCMCIA card, etc. 

The invention further relates to a device for communicating a multichannel 
signal, the device comprising an arrangement for encoding the multichannel signal as 
described above and in the following. 

10 

These and other aspects of the invention will be apparent from and elucidated 
with reference to the embodiments and with reference to the drawing, in which: 

fig. 1 shows a schematic view of a system for communicating stereo signals 
according to an embodiment of the invention; 
15 fig. 2 shows a schematic view of an arrangement for encoding a multichannel 

signal according to a first embodiment of the invention; 

fig. 3 shows a schematic view of an arrangement for decoding a multichannel 
signal according to the first embodiment of the invention; 

fig. 4 shows a schematic view of an arrangement for encoding a stereo signal 
20 according to a second embodiment of the invention; 

fig. 5 illustrates the determination of the signal transformation according to an 
embodiment of Ihe invention; 

fig. 6 shows a schematic view of an arrangement for decoding a stereo signal 
according to the second embodiment of the invention; 
25 figs. 7a-c show schematic views of examples of a filter circuit for use in an 

embodiment of the invention; 

fig. 8 shows a Schematic view of an arrangement for encoding a stereo signal 
according to a third embodiment of the invention; 

fig. 9 shows a schematic view of an arrangement for encoding a stereo signal 
30 according to a fourth embodiment of the invention; 

fig. 10 shows a schematic view of an arrangement for decoding a stereo signal 
according to the fourth embodimCTit of the invention; 

fig. 1 1 shows a schematic view of an arrangement for encoding a multichannel 
signal according to a fifth embodiment of the invention; and 
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fig. 12 shows a schematic view of a subtraction circuit for use with au 
embodiment of the invention. 



5 Fig. 1 shows a schematic view of a system for communicating stereo signals 

according to an embodiment of the invention. The system comprises a coding device 101 for 
generating a coded stereophonic signal and a decoding device 105 for decoding a received 
coded signal into a stereo L signal and a stereo R signal component. The coding device 101 
and the decoding device 105 each may be any electronic equipment or part of such 

10 equipment. Here the term electronic equipment comprises computers, such as stationary and 
portable PCs, stationary and portable radio communication equipment and other handheld or 
portable devices, such as mobile telephones, pagers, audio players, multimedia players, 
communicators, i.e. electronic organisers, smart phones, personal digital assistants (PDAs), 
handheld computers, or the like. It is noted that the coding device 101 and ttie decoding 

15 device may be combined in one electronic equipment where stereophonic signals are stored 
on a computer-readable medium for later reproduction. 

The coding device 101 comprises an encoder 102 for encoding a stereophonic 
signal according to the invention, the stereophonic signal including an L signal component 
and an R signal component. The encoder receives the L and R signal components and 

20 generates a coded signal T. The stereophonic signal L and R, may originate from a set of 
microphones, e.g. via further electronic equipment, such as a mixing equipment, etc. The 
signals may further be received as an output from another stereo player, over-the-air as a 
radio signal, or by any other suitable means. Preferred embodiments of such an encoder 
according to the invention will be described below. According to one embodiment, the 

25 encoder 102 is connected to a transmitter 103 for transmitting the coded signal T via a 

communications channel 109 to the decoding device 105. The transmitter 103 may comprise 
circuitry suitable for enabling the communication of data, e.g. via a wired or a wireless data 
link 109. Examples of such a transmitter include a network interface, a network card, a radio 
transmitter, a transmitter for other suitable electromagnetic signals, such as an LED for 

30 transmitting infrared Ught, e.g. via an IrDa port, radio-based communications, e.g. via a 
Bluetooth transceiver, or the like. Further examples of suitable transmitters include a cable 
modem, a telephone modem, an Integrated Services Digital Network (ISDN) adapter, a 
Digital Subscriber Line ^SL) ad^ter, a satelUte transceiver, an Ethernet adapter, or the like. 
Correspondingly, the communications channel 109 maybe any suitable wired or wireless 
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data link, for example of a packet-based commimications network, such as the Ihtemet or 
another TCP/IP network, a short-range communications link, such as an infrared link, a 
Bluetooth cotmection or anotiier radio-based link. Further examples of the communications 
channel include computer networks and wireless telecommunications networks, such as a 

5 Cellular Digital Packet Data (CDPD) network, a Global System for Mobile (GSM) network, 
a Code Division Multiple Access (CDMA) network, a Time Division Multiple Access 
Network (TDMA), a General Packet Radio service (GPRS) network, a Third Generation 
network, such as a UMTS network, or the like. Alternatively or additionally, the coding 
device may comprise one or more other interfaces 104 for communicating the coded stereo 

1 0 signal T to the decoding device 105. Examples of such interfaces include a disc drive for 
storing data on a computer-readable medium 1 10, e.g. a floppy-disk drive, a read/write CD- 
ROM drive, a DVD-drive, etc. Other examples include a memory card slot a magnetic card 
reader/writer, an interface for accessing a smart card, etc. Correspondingly, the decoding 
device 105 comprises a corresponding receiver 108 for receiving tiie signal transmitted by the 

15 transmitter and/or another interface 106 for receiving tiie coded stereo signal communicated 
via the interface 104 and the computer-readable medium 110. The decoding device further 
comprises a decoder 107 which receives the received signal T and decodes it into 
corresponding stereo components L* and R'. Preferred embodiments of such a decoder 
according to the invention will be described below. The decoded signals L' and R' may 

20 subsequently be fed into a stereo player for reproduction via a set of speakers, head-phones, 
or the like. 

Fig. 2 shows a schematic view of an arrangement for encoding a multichannel 
signal according to a first embodiment of the invention. According to fliis embodiment, the 
multichannel signal comprises two components Si and Sa- The arrangement comprises an 

25 adaptive filter 201 receiving the signal component Si as an input and generating a filtered 
signal §2. The filter parameters Fp of the adaptive filter are selected such that the filtered 
signsd §2 ^proximates the second signal component S2, e.g. by controlling the adaptive filter 
201 by the error signal e indicating the difference between S2 and §2 as generated by a 
subtraction circuit 203. The filter 201 may be any suitable filter known in the art. Examples 

30 of such filters include a finite impulse response (FIR) filter or a infinite impxxlse response 
(IIR) filter, adaptive or fixed, with the cut-off fi-equencies and magnitudes being fixed or 
tracked recursively, or the like. The filter may be of any order, preferably smaller than 10. 
The type of the filter can be Butterworth, Chebychev, or any other suitable type of filter. In 
the example of audio signals, examples of such adaptive filters include an ad£^tive filter 
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known from the field of echo cancellation, or a filter based on a psychoacoustic model of the 
human auditory system, e.g. as is known from MPEG coding, thereby reducing the number of 
filter parameters. According to another embodiment the filter may fiirther be simplified, e.g. 
by using a 10^ order filter using S BiQuadratic filters and an artificial reverberation unit. In 
5 this embodim^it, at the encoding side, the filter is fitted and the reverberation time is 

determined. These parameters are varying slowly, thereby reducing the necessary bit rate for 
their transmission. 

The resulting filter parameters Fp are fed into an encoder 205, e.g. an encoder 
providing a Hufifinan encoding or any other suitable coding scheme, resulting in encoded 

10 filter parameters Fp©. The encoded filter parameters Fpc are fed into a combiner circuit 204. 
The arrangement fiirther comprises encoders 202 performing a proper encoding of the signal 
component Si. For example, in the case of audio signals, the signal Si maybe encoded 
according to MPEG, e.g. MPEG I layer 3 (MPS), according to sinusoidal coding (SSC), or 
audio coding schemes based on subband, parametric, or transform schemes, or any oth^ 

15 suitable schemes or combination thereof. The resulting coded signal Si,e is fed into the 
combiner circuit 204 together with the filter parameters Fp. The combiner circuit 204 
performs framing, bit-rate allocation, and lossless coding, resulting in a combined signal T to 
be communicated. 

Fig. 3 shows a schematic view of an arrangement for decoding a multichannel 
20 signal according to the first embodiment of the invention. The arrangement receives a coded 
multichannel signal T, for example originating from an encoder according to the embodiment 
described in connection of fig. 2. The arrangement comprises a circuit 301 for extracting the 
encoded signal Si^e and the encoded filter parameters Fpe Scorn the combined signal T, i.e. the 
circuit 301 performs an inverse operation of the combiner 204 of fig. 2. The filter parameters 
25 are decoded by a decoder 303 corresponding to the encoding of tiie filter parameters by the 
encoder 205 of fig. 2. The extracted signal Si,e is fed into a decoder 302 for performing audio 
decoding corresponding to the encoding performed by the encoder 202 of fig. 2, resulting in 
the decoded first signal component signal Si'. The signal Si* is fed into a filter 303 together 
with die decoded filter parameters Fp. The filter 304 generates a corresponding estimated 
30 second signal component §2*- Hence, the decoder of fig. 2 generates an output corresponding 
to the received first signal component Si' and the estimated second signal component §2'. 

Fig. 4 shows a schematic view of an arrangement 102 for encoding a stereo 
signal according to a second embodiment of the invention. The arrangement comprises 
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circuitry 401 for performing a rotation of the stereo signal in the L-R space by an angle a, 
resulting in rotated signal components y and r according to the transformation 

y = L cos a + R sin a = wl L + wrR 

r = -L sin a + R cos a = -wrL + WlR, (1) 

where WL=cosa and WR=^ina will be referred to as weighting factors. 

According to this embodiment, tiie angle a is determined such that it 
corresponds to a direction of higih signal variance. The direction of maximum signal variance, 
i.e. the principal component, may be estimated by a principal component analysis such that 
the rotated y component corresponds to the principal component signal which includes most 
of the signal energy, and r is a residual signal. Correspondingly, the arrangement of fig. 4 
comprises circuitry 400 which determines the angle a or, alternatively, the wei^t factors wl 
and wr. 

Referring to fig. 5, according to a preferred embodiment, the above weight 
factors Wl and wr are determined according the following algoriflun: 

Initially, the incoming stereo signals L and R are rectified and lowpass 
filtered, resulting in envelope signals p(k) of L and q(k) of R, respectively, where p(k) and 
q(k) are suitably sampled and the sample index is denoted k. Thus, the vector x(k) = Cp(k), 
q(k)) denotes the incoming signal vector. Alternatively, the signals L and R may be used 
directly, i.e. without filtering, or other filtered versions of L and R may be used, e.g. highpass 
filtered signals L and R, In fig. 5 a number of signal points are illustrated as circles. As an 
example, the signal point x(k) and its corresponding components p(k) and q(k) are indicated. 
According to the invention, the signals are rotated in the direction of the principal component 
of the signal vectors. In the example of fig. 5, this corresponds to the y direction where a is 
the angle between the y direction and the p direction. The weight vector w = (wl, Wr) 
indicates the direction of the principal component, and the rotated components of x(k) are 
denoted y(k) and r(k), respectively. 

The principal component may be determined by any suitable metiiod known in 
the art. In a particularly advantageous embodiment, an iterative method utilising Oja's rule 
(see e.g, S. Haykin: *TMeural Networks'*, Prentice Hall, N J., 1999) is used. According to this 
embodiment, the weight vector w is iteratively estimated according to liie following equation 
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w(k) = w(k-l) + \i [x(k-l) - w(k-l) y(k-l)], (2) 

where w(k) = ( wlOc), wrOc) ) corresponds to the estimate at time k. The above itCTation may, 
for example, be initiated witibi a set of small random weights w(0), or in any other suitable 
way. The above estimated weight vector may be used to calculate the rotated signal 
according to y(k) = w'^(k)x(k). Alternatively, the iteration of eqn. (2) may be perfomied on a 
block basis, e.g. for a block of N samples, where N depends on the particular implementation, 
for example, N=512, 1024, 2048, etc. In this embodiment, the estimated weight vector w(N) 
for a block may be used in the transformation of all samples of that block according to y(k) = 
w'^(N)x(k). 

The factor |li in eqn. (2) corresponds to a time scale of the tracking algorithm. 
If ti==0, the weighting factors and, flius, the angle a, remain constant, while they change 
rapidly for large |a. As an example, for a block size of 2048 samples, ix may be selected of the 
order of 10"^ for a sampling rate of 44.1 kHz. 

It is an advantage of the above iterative algorithm that it is linear, i.e. it does 
not require the calculation of any trigonometric functions, square roots or the like. It is a 
further advantage, that the above iteration yields a normalised weight vector w, as the term - 
p.w(k-l)y(k-l) in eqn. (2) corresponds to a weight decay temi pCTialising large weights while 
flie term + n x(k-l) drives the weight vector in the direction of the principal component It is 
further noted that in the present embodiment, since x(k) is the envelope signal, w^wr e 
[0,1], i.e. the weight vector w lies in the first quadrant in fig. 5, thereby ensuring that is 
positive. It is a further advantage of this embodiment that it suffices to transmit one of Wl and 

wr, as the other factor may be determined according to Wr = ^p- (Wl )^ . Alternatively, the 

angle a may be transmitted. 

Again referring to fig. 4, the circuit 400 outputs the determined angle a or, 
alternatively, one or both of the weight factors Wl and wr. The angle information is fed into 
the rotation ch-cuit 401 which generates the rotated signal components y and r. It is 
understood that the circuits 400 and 401 may be combined in a single circuit performing the 
iterative calculation of eqn. (2) and the calculation of y and r according to eqn. (1). 

According to this embodiment of the invention, it is recognised that the 
residual signal r may be estimated as a filtered version of the principal signal y. In an acoustic 
recording of an audio source recorded by two microphones in the absraice of acoustic 
distortions, e.g. due to reflections, etc., the principal signal y corresponds to the audio source 



wo 03/085645 PCT/IB03/01154 

13 

and the residual signal is substantially zero. For example, the stereo signals L and R may be 
expressed as 1>=M+S and R=M-S, where M corresponds to a mid or centre signal and S 
corresponds to a stereo or side signal. In the case of an acoustic recording of a stationary 
sound source, e.g. a speaker recorded by two microphones, the L and R signals are 
5 substantially equal, if the speaker is positioned exactly between the microphones and 

assuming that there are no acoustic distortions such as reflections, etc. Hence, in this case S is 
substantially zero or at least small and the coding scheme according to this embodiment 
substantially yields y corresponding to L+R and r corresponding to L-R being zero or small; 
this corresponds to a = 45 degrees. If the speaker is not positioned exactly between the 

10 microphones, i.e. there is an asymmetry, but still assuming that there are no reflections or 
other distortions, the rotated signal y according to the invention still corresponds to the 
speaker and the residual signal r is substantially zero. However, in this case the angle a 
differs from 45 degrees. 

In a more realistic situation distortions are present, e.g. due to reflections of 

15 the signal at the walls of a room and at the head and torso of the speaker, etc. These effects 
influence the residual signal r. Consequently, when estimating the residual signal by a filter, 
the filter in effect models the room acoustics, etc. For a classical orchestra the situation is 
similar, while in the case of modem pop music the situation may be slightly different. In this 
case, a sound engineer typically mixes multiple channels into two chaimels, often using 

20 artificial reverberation, effect boxes etc. In this case the filter models the acoustic effects 
introduced by the mixing process. 

Accordingly, still referring to fig. 4, the arrangement further comprises an 
adaptive filter 201 receiving the principal signal y as an input and generating a filtered 
signal P . The filter parameters Fp of fiie adaptive filter are selected such that the filtered signal 

25 f approximates the residual signal r, e.g. by controlling the adaptive filter 201 by the error 
signal e indicating the difference between r and P as generated by a subtraction circuit 203. 
The resulting filter parameters Fp are fed into an encoder 205, e.g. an encoder providing a 
Huffinan encoding or any other suitable coding scheme, resulting in encoded filter 
parameters Fpe. The encoded filter parameters Fpe are fed into a combiner circuit 204. The 

30 filter 201 may be any suitable filter known in the art. Example of such filters include a finite 
impulse response (FIR) filter or a infinite impulse response (HR) filter, adaptive or fixed, 
with the cut-off frequencies and magnitudes being fixed or tracked recursively, or the like. 
The filter maybe of any order, preferably smaller than 10. The type of the filter can be 
Butterworth, CJhebychev, or any other suitable type of filter. The arrangement further 
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comprises an encoder 202 for encoding the principal signal as described in connection with 
fig. 2, residting in the encoded principal signal which is fed into the combiner circuit 204 
together with the filter parameters Fp and the angle information a. As described in connection 
with fig, 2, the combiner circuit 204 performs firaming, bit-rate allocation, and lossless 
5 coding, resulting in a combined signal T to be connnunicated which includes the encoded 
principal signal ye, the filter parameters Fp and the angle infonnation a. In one embodiment, 
the angle a or, alternatively, wl and/or wr may be communicated as part of a header 
transmitted prior to a signal firame, a signal block, or the like. 

According to tiie invention, as the transformation angle a is tracked such that 

10 the principal component signal includes most of the signal energy, the bit rates allocated to 
the y and r signals may be selected to be different, thereby optimising the coding efficiency. 
As described above, in the example of an acoustic recording of an audio source recorded by 
two microphones in the absence of acoustic distortions, the principal signal y corresponds to 
the audio source and the residual signal is substantially zero. In this example, the angle a 

15 corresponds to the position of the soimd source relative to the microphones. If the sound 
source moves, e.g. firom left to right, the method according to the invention still yields a 
principal component signal y corresponding to the source and a small residual signal r, 
ideally being r=0. In this case, a changes form 0 (fixlly left) to 90 degrees (fiilly right). The 
above example illustrates the advantage of tracking the angle a. Hence, it is an advantage of 

20 the invention that it allows an efficient coding of stereo signals. 

According to this embodiment of the invention, the bit rate to be allocated to 
the filter parameters Fp may be considerably smaller than the bit rate necessary for the 
principal signal y, e.g. in one embodiment, the bit-rate for Fp may, on average, be less than 
10% of the bit rate for y. Hence, it is an advantage of the invention that it reduces the bit rate 

25 necessary for transmitting a stereo signal. The total bit rate according to the invention is only 
sUghtly higher than for a single mono channel. It is noted, however, that this ratio may vary 
diuing a recording. For example, the ratio may become smaller, e.g. in a situation with Uttle 
distortions and a stationary source, but also larger, e.g. if the L and R signals are momentarily 
independent. 

30 Fig. 6 shows a schematic view of an arrangement 107 for decoding a stereo 

signal according to the second embodiment of the invention. The arrangement receives a 
coded storeo signal T, for example originating fix)m an ©acoder according to the embodiment 
described in coimection with fig. 4. The arrangement comprises a circuit 301 for extracting 
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the encoded signals ye, fhe encoded filter parameters Fpe, and the angle information a from 
the combined signal T, i.e. the circuit 301 performs an inverse operation of the combiner 204 
of fig. 4. The extracted signal ye is fed into a decoder 302 for performing audio decoding 
corresponding to the encoding performed by the encoder 202 of fig. 4, resulting in the 
5 decoded principal component signal y\ The encoded filter parameters Fpe are decoded by a 
decoder 303 corresponding to the encoding of the filter parameters by the encoder 205 of fig. 
4. The signal y' is fed into a filter 304 together with the decoded filter parameters Fp, The 
filter 304 generates a corresponding estimated residual signal f ' . The received principal 
component signal y', the estimated residual signal T and the received angle information a 

10 are fed into a rotation circuit 601 which rotates the signals y% f back in the direction of the 
original L and R components, thus resulting in the received signals L' and R'. 

In the embodiment described in connection with figs. 4 and 6, the filters 201 
and 304 may be standard adaptive filters in the temporal or time domain (see e.g. "Adaptive 
Filter Theory", by S. Haykin, Prentice Hall, 2001), e.g. an adaptive filter known firom the 

15 field of echo cancellation. Other examples of filters include a fixed FIR or IIR filter with a 
fixed or adaptive cut-ofif-frequency and magnitude. Alternatively, the filter may be based on 
apsychoacoustic model of the human auditory system or another suitable filter, e.g. using a 
10^ order filter using 5 BiQuadratic filters and an artificial reverberation unit, as described in 
connection with fig. 2. 

20 Figs. 7a-c show schematic views of examples of a filter circuit for use in an 

embodiment of the invention. 

In the example of fig. 7a, the filter 201 comprises a combination of a filter 701 
and a reverberation filter 702. For example, the filter 701 may be a standard adaptive filter in 
the temporal or time domain, a fixed FIR or IIR filter with a fixed or adaptive cut-off- 

25 frequency and magnitude, etc., e.g. a high-pass filter. According to this embodiment, both the 
filter parameters of the filter 701 and the parameters of the reverberation filter 702, such as 
the reverberation time denoted Teo, are transmitted to the decoder as filter parameters Fp. 

In the example of fig. 7b, in addition to the filters 701 and 702, two control 
circuits 703-704 are added. A control circuit 703 is added to ensure that the average power of 

30 ttie residual signal r and the average power of the output of the reverberator 702 aire 
approximately the same, e.g. by multiplying the output of the reverberator 702 with a 
parameter Pi. A second control circuit 704 multiplies the scaled output of the reverberator 
with p2- The factor P2 may be selected in the range between -3dB and +6dB and it is 
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determined such that the cross correlation p between r and f is as high as possible, i.e. that 
the signals r and f are as similar as possible. Hence, the filter arrangement of fig. 7b further 
comprises a circuit 705 for determining the cross correlation p. The filter arrangement further 
comprises a multipUer 706 for generating the product p=Pip2 which is output as a part of the 
filter parameters Fp. Hence, Pi is a gain that is automatically controlled, e.g. by comparing 
the absolute mean of r and f , and P2 is another gain that is automatically controlled, e.g. by 
use of the cross-correlation coefficient p. The first gain is intended to make sure that the 
energy of r is preserved, i.e. that the energy of the predicted signal f at the receiver 
corresponds to the energy of r. The second gain is to make sure that r and P' are well 
correlated. 

In one embodiment, the reverberator 702 and the filter 701 may be fixed, i.e. 
not adJ5)ted according to the filter parameters Fp. Further, P2 may be fixed, tihiereby leaving 
the slowly varying parameter pi as the only adjqptive parameter which needs to be adjusted 
and transmitted. Consequently, a particularly simple filter arrangement is provided. It is an 
advantage of this embodiment that it only requires about half the original stereo bit rate for 
transmitting a stereo signal. It is noted that fiurther variations of the above embodiment may 
be used. For example, in one embodiment the filter 701 may be left out. 

Furthermore, altematively or additionally to the correlation p, other measures 
of correlation may be used to ensure a high degree of similarity between the original signal 
and the signal after encoding-decoding. For example, in one embodiment two correlators may 
be used instead of correlator 705. One correlator may conapute the cross-correlation plr of 
the input signals L and R. Furthermore, a second correlator may compute the cross 
correlation p'lr of the resulting ou^uts L' and R' of the encoder-decoder, i.e. accordmg to 
this embodiment, the encoder further comprises a decoder circuit for determining the signals 
U and R'. This embodiment uses the difference Sp=pLR - p'lr to control p2 such that Sp is 
minimal. This is illustrated in fig. 7c, where the correlator of fig. 7b is replaced by circuit 707 
which receives the signals L and R as well as L' and R' as inputs and generates as an output a 
signal indicative of the difference 8p. The output Sp of circuit 707 controls circuit 704 to scale 
the estimated residual f such that 8p is minimised. In one embodiment, the inputs to circuit 
707 are high-pass filtered, e.g. at 250Hz, such that the low firequaacies have a decreasing 
contribution to 8p. As in the embodiment of fig. 7b, it is an advantage of this embodiment that 
the correlation between the resulting stereo image and the original stareo image before the 
coding-decoding is very hi^. 
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Fig. 8 shows a schematic view of an arrangement for encoding a stereo signal 
according to a third embodiment of the invention. The arrang^ent is a variation of the 
embodiment described in connection with fig. 4, and it comprises circuitry 401 for 
performing a rotation of the stereo signals L and R, circuitry 400 for determining the angle of 
5 rotation, an adaptive filter 201, a subtraction circuit 203, an encoder 202, an encoder 205, and 
a combiner circuit 204, as described in connection with fig. 4. According to this embodiment, 
the principal component signal y is not directly fed into the filter 201. Instead, the 
arrangement fiirther comprises a decoder 302 as described in connection with fig. 6. The 
decoder 302 receives the encoded principal component signal ye generated by the encoder 

10 202 and generates the decoded principal signal y' which is fed into the filter 201. It is an 
advantage of this embodiment that it reduces the effect of coding errors introduced by the 
coding and decoding of the signal y. These coding errors cause the decoded signal y' to be 
slightly different fix)m the original signal y due to the fact that the decoder 302 in practice is 
not a perfect inverse of the encoder 202, i.e. EE'^ ^l. Consequently, by applying an 

1 5 encoding and decoding of the signal y at the decoder, the input y' to the filter 201 

corresponds to the input y' fed into the filter 304 (of fig. 6) at the receiver, thereby improving 
the result of the prediction of of the residual signal at the receiver. Hence, the encoder 
according to this embodiment may be used in coimection with a decoder according to the 
embodunentoffig. 6. 

20 Fig. 9 shows a schematic view of an arrangement for encoding a stereo signal 

according to a fourth embodiment of the invention. The arrangement is a variation of the 
embodiment described in connection with fig. 4, and it comprises circuitry 401 for 
performing a rotation of the stereo signals L and R, circuitry 400 for determining the angle of 
rotation, an adaptive filter 201, a subtraction circuit 203, an encoder 202, an encoder 205, and 

25 a combiner circuit 204, as described in connection with fig. 4. According to this embodiment, 
the principal component signal y is not directly fed into the filter 201 . Instead, the 
arrangement fiirther comprises a multiplication circuit 901 multiplying the residual signal r 
received firom circxiit 401 with a constant y, and an adding circuit 902 for adding the scaled 
residual signal to the principal component signal y, resulting in a signal y + y r which is fed 

30 into the filter 201 . Here, y is a small positive value, e.g. of the order of 10'^. In one 

embodiment, the constant y is tracked adaptively. It is an advantage of this embodiment that 
frequencies which are substantially not present in the spectrum of the signal y but present in 
the spectrum of r may be utilised in the modelling of the residual signal f by the filter 201, 
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thereby improving the quality of the coded signal. According to this embodinoient the signal y 
+ y r is fed into the encoder 202 which generates the decoded principal signal ye to be 
transmitted to the receiver. Finthermore, accordiag to this embodiment, the constant 7 is fed 
into the combiner 204 and transmitted to the receiver. 
5 Fig. 10 shows a schematic view of an arrangement for decoding a stereo signal 

according to the fourth embodiment of the invention, i.e. suitable for decoding a signal 
received from an encoder according to fig. 9. The arrangement comprises a circuit 301 for 
extracting the received information from the combiaed signal T, a decoder 302, a decoder 
303, a filter 304, and a rotation circuit 601 as described in connection with fig, 6. Accordiag 

10 to this embodiment, the circuit 301 ftirther extracts the constant y from the combined signal 
T, and the arrangement finiiier comprises a multiplication circuit 1001 for multiplying the 
predicted residual signal P' generated by the filter 304 with the received constant y. The 
arrangement ftirther comprises a circuit 1002 for subtracting the resulting scaled predicted 
residual signal y from the decoded principal signal y'. 

15 Fig. 1 1 shows a schematic view of an arrangement for encoding a multi- 

channel signal according to a fifth embodiment of the invention. The arrangement receives a 
multichaimel signal comprising n channels Si,. . .,Sn. The arrangement comprises a principal 
component analyser 1 100 for perfomung a principal component analysis of the signal 
componeaits Si,. . .,Sn, resulting in a weight vector w = (wi,. . .,Wn) for transforming the ii5)ut 

20 signal into a principal component signal y and n-1 residual signals ri, r2, . . rn-i- The 
arrangement ftirther comprises a transformation circuit 1 101 receiving the input signal 
components Si,. . .,Sn and the determined weight vector w, and generating the signals y and ri, 
. . rn-i according to the above transformation. The principal component signal y is fed into a 
set of adaptive filters 201, each predicting one of the residual signals ri,. . .,rn-i, as described 

25 in connection with fig. 4, resulting in corresponding filter parameters Fpi,. . ., Fp(n.i) which are 
fed into corresponding encoders 205 and, subsequently, into the combiner 204. At a 
correspondiag decoder (not shown), corresponding filters are used for generating estimates 
f\ . ., f*n_i of the residual signals based on the filter parameters, as described in connection 
with fig. 6. The arrangement fiarther comprises an encoder 202 for encoding the principal 

30 component signal y, resulting in an encoded signal y© which is also fed into the combino: 
204. 
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It is understood that, according to one embodiment, only a subset of residual 
signals, e.g. ri,. . .,rk, k<n-l, may be transmitted to the receiver or fed into corresponding 
filters, tiiereby reducing the necessary bit rate while maintaining most of the signal quality. 

Fig. 12 shows a schematic view of a subtraction circuit for use with an 
5 embodiment of the invention. In the above embodiments, the filter parameters are determined 
by comparing a target signal with an estimated signal, i.e. by the error signal e indicating the 
difference between r and P as generated by a subtraction circuit 203. It is understood that the 
subtraction circuit may generate different measures of diflference between r and f , for 
example a difference may be determined in the time domain or in tibie firequency domain. 

10 Referring to fig. 12, the circuit 203 may comprise circuits 1201 for transforming the signals r 
and f , respectively, into the firequency domain, e.g. by performing a fast FouriCT 
transformation (FFT). The resulting frequency components may be fiirther processed by 
respective circuits 1204. For example different firequencies maybe weighted differently, 
preferably according to the properties of the human auditory system, thereby weighting 

15 differences in the audible fi-equency range more strongly. Other examples of fiirther 
processing by the circuits 1204 include an averaging over predetermined fi-equency 
components, calculating the magnitude of the complex frequency components, clustering of 
filter components, or the like. For example, in a preferred embodiment, a clustering is 
performed prior to the subtraction in the frequency domain. This clustering may be 

20 performed using a filter-bank, e.g. with linear or logarithmic sub-bandwidths. Alternatively, 
the clustering may be performed using the so-called equivalent rectangular bandwidth (ERB) 
(see e.g. "An introduction to the Psychology of Hearing", by Brian Moore, Academic Press, 
London, 1997). The equivalent rectangular bandwidth technique clusters frequCTicy-bands 
that correspond to the human auditory filters, e.g. the so-called critical bands. According to 

25 this embodiment, the corresponding value of the ERB as a fimction of centre frequency, f (in 
kHz), is may be calculated accordmg to ERB = 24.7( 4.37 f + 1 ). Still referring to fig. 12, the 
circuit 203 fiirther comprises a subtraction circuit 1203 for subtracting the processed 
frequency components. Altematively, the transformed signals generated by the circuits 1201 
are directly fed into the subtraction circuit 1204 without fiirther processing. The difference 

30 signal generated by the subtraction circuit 1204 is fed into a transformation circvrit 1202 for 
transforming the error signal back into the time domain, e.g. by performing an inverse fast 
Fourier transform (IFFT). Altematively, tiie difference signal in the frequency domain may 
be used directiy. 
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It is understood that a skilled person may addpt the above embodiments, e.g. 
by adding or removing features, or by combining features of the above embodiments. For 
example, it is understood that the features introduced in embodiments of fig. 8 and 9 may be 
incorporated in the embodiment of fig. 11 as well. As another example, the error signal e 
5 describing the quality of the estimated residual signal in the embodiment of fig. 4 may be 
compared to a threshold error indicating a maximxim acceptable error. If the error is not 
acceptable, the error signal may, after suitable coding, be transmitted together with the signal 
T sinfiilar to the methods used within the field of Linear Predictive Coding (LPC). 

It is fiirther noted that the invention is not limited to stereophonic signals, but 

10 may also be applied to other multi-channel input signals having two or more input channels. 
Examples of such multi-chaimel signals include signals received firom a Digital Versatile 
Disc (DVD) or a Super Audio Compact Disc, etc. In this more general case, a principal 
component signal y and one or more residual signals r may still be generated according to the 
invention. The mmiber of residual signals transmitted depends on the niunber of channels and 

15 the desired bit rate, as higher order residuals may be omitted without significantly degrading 
the signal quality. 

In general, it is an advantage of the invention that bit-rate allocation may be 
adaptively varied, thereby allowing gracefiil degradation. For example, if the commimication 
channel momentarily only allows a reduced bit rate to be transmitted, e.g. due to increased 

20 network traffic, noise, or the Uke, the bit rate of the transmitted signal may be reduced 

without significantly degrading the perceptible quality of the signal. For example, in the case 
of a stationary sound source discussed above, fhe bit rate may be reduced by a factor of 
approximately two without significantly degrading the signal quality, corresponding to 
transmitting a single channel instead of two. 

25 It is noted that the above arrangements may be implemented as general- or 

special-purpose programmable microprocessors. Digital Signal Processors (DSP), 
Application Specific Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field 
Programmable Gate Arrays (FPGA), special purpose electronic circuits, etc., or a 
combination thereof. 

30 It should be noted that the above-mentioned embodiments illustrate rather than 

limit the invention, and that those skilled in the art will be able to design many altemative 
embodiments without departing firom the scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be constmed as limiting the claim. The 
word 'comprising' does not exclude the presence of other elements or steps than those listed 
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in a claim. The invention can be implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer. In a device claim 
enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere feet that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 



